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Abstract — In this paper, we address the scenario where nodes 
with sensor data are connected in a tree network, and every 
node wants to compute a given symmetric Boolean function of 
the sensor data. We first consider the problem of computing 
a function of two nodes with integer measurements. We allow 
for block computation to enhance data fusion efficiency, and 
determine the minimum worst-case total number of bits to be 
exchanged to perform the desired computation. We establish 
lower bounds using fooling sets, and provide a novel scheme 
which attains the lower bounds, using information theoretic tools. 
For a class of functions called sum-threshold functions, this 
scheme is shown to be optimal. 

We then turn to tree networks and derive a lower bound for 
the number of bits exchanged on each link by viewing it as a 
two node problem. We show that the protocol of recursive in- 
network aggregation achieves this lower bound in the case of sum- 
threshold functions. Thus we have provided a communication and 
in-network computation strategy that is optimal for each link. All 
the results can be extended to the case of non-binary alphabets. 

In the case of general graphs, we present a cut-set lower bound, 
and an achievable scheme based on aggregation along trees. For 
complete graphs, the complexity of this scheme is no more than 
twice that of the optimal scheme. 

I. INTRODUCTION 

Wireless sensor networks are composed of nodes with 
sensing, wireless communication and computation capabilities. 
In sensor network applications, one is not interested in the raw 
sensor measurements, but only in computing some relevant 
function of the measurements. For example, one might want to 
compute the mean temperature for environmental monitoring, 
or the maximum temperature in fire alarm systems. Thus, 
we need to move away from a data-centric paradigm, and 
focus on efficient in-network computation and communication 
strategies which are function-aware. 

There are two possible architectures for sensor networks 
that one might consider. First, one can designate a single 
collector node which seeks to compute the function of interest. 
Alternately, one can suppose that every node in the network 
wants to compute the function. The latter architecture can be 
viewed as providing implicit feedback to each sensor node, 
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which could be useful in applications like fault monitoring 
and adaptive sensing. For example, sensor nodes could modify 
their sampling rate depending on the value of the function. In 
this paper, we will focus on strategies which achieve function 
computation with zero error for all sensor nodes. 

In this paper, we abstract out the medium access con- 
trol problem, and view the network as a graph with edges 
representing essentially noiseless wired links between nodes. 
Hence, we focus on strategies for combining information at 
intermediate nodes, and optimal codes for transmissions on 
each edge. This is a significant departure from the traditional 
decode and forward paradigm in wireless networks. Moreover, 
the strategy for computation may benefit from interactive 
information exchange between nodes. 

We consider a graph where each node has a Boolean vari- 
able and focus on the problem of symmetric Boolean function 
computation. We adopt a deterministic formulation of the 
problem of function computation, allowing zero error. We con- 
sider the problem of worst-case function computation, without 
imposing a probability distribution on the node measurements. 
Further, instead of restricting a strategy to compute just one 
instance of the problem, we allow for nodes to accumulate 
a block of B measurements, and realize greater efficiency by 
using block codes. The set of admissible strategies includes 
all interactive strategies, where a node may exchange several 
messages with other nodes, with node ;'s transmission being 
allowed to depend on all previous transmissions heard by node 
i, and node i's block of measurements. 

We begin with the two node problem in Section [TTH where 
each node i has an integer variable Xj and both nodes want to 
compute a function f(X\,X2) which only depends on X\ +X2. 
We use a lower bound from the theory of communication 
complexity, by constructing an appropriate fooling set HI. 
For achievability, we devise a single-round strategy so as 
to minimize the worst-case total number of bits exchanged 
under the Kraft inequality constraint. For the class of sum- 
threshold functions, which evaluate to 1 if X\ +X2 exceeds a 
threshold, this single-round strategy is indeed optimal. How- 
ever, for the class of sum-interval functions, which evaluate 
to 1 if a < X{ + X2 < b, the upper and lower bounds do not 
match. However, the achievable strategy involving separation, 
followed by coding, can be used for any general function. 

In Section IIV1 we consider Boolean symmetric function 



computation on trees. Since every edge is a cut-edge, we 
can obtain a cut-set lower bound for the number of bits that 
must be exchanged on an edge, by reducing it to a two node 
problem. For the class of sum-threshold functions, we are 
able to match the cut-set bound by constructing an achievable 
strategy that is reminiscent of message passing algorithms. 

In Section[V] for general graphs, we can still derive a cut-set 
lower bound by considering all partitions of the vertices. We 
also propose an achievable scheme that consists of activating 
a subtree of edges and using the optimal strategy for transmis- 
sions on the tree. While the upper and lower bounds do not 
match even for very simple functions, for complete graphs we 
show that aggregation along trees provides a 2-OPT solution. 

II. RELATED WORK 

The problem of function computation in sensor networks 
has received much attention recently. In 12, the problem 
of worst-case block function computation with zero error 
was formulated. The authors identify two classes of sym- 
metric functions namely type-sensitive functions exemplified 
by Mean, Median and Mode, and type-threshold functions, 
exemplified by Maximum and Minimum. The maximum rates 
for computation of type-sensitive and type-threshold functions 
in random planar networks are shown to be ©(j^) and 
®( iogiogn ) res P ect ively, where n is the number of nodes. For 
the case of a designated collector node interested in computing 
the function, one can derive a per-link optimal strategy for 
block function computation in tree networks Q. 

In contrast, in this paper, we require that every node must 
compute the function. The latter approach naturally allows the 
use of tools from communication complexity JTJ. The com- 
munication complexity of Boolean functions has been studied 
in flU. Further, one can consider the direct-sum problem J5] 
where several instances of the problem are considered together 
to obtain savings. This block computation approach is used to 
compute the exact complexity of the Boolean AND function 
in (6). This result was considerably generalized in Q to 
derive optimal strategies for computing symmetric Boolean 
functions in broadcast networks. The average and randomized 
complexity of Boolean functions are studied in 0. 

While we focus on worst-case computation in this paper, we 
could suppose that the measurements are drawn from some 
joint probability distribution. The problem of source coding 
with side information has been studied for the vanishing error 
case in |9|, and for the zero error case in iflOl . The problem of 
source coding for function computation with side information 
has been studied in IfTTI . However, a tractable information 
theoretic formulation of the problem of function computation 
has proved elusive. 

In this paper, we allow for all interactive strategies where 
each node's transmission is allowed to depend on all previous 
transmissions and the node's measurements. Thus, all network 
coding strategies lfl2l are subsumed in this class. The rate 
region for multi-round interactive function computation has 
been characterized for two nodes (fl~3), and for collocated 
networks fl4l . 



III. The two node problem 

Consider two nodes 1 and 2 with variables X\ £ 
{0,1,..., mi} and X2 £ {0,1,..., m^}. Both nodes wish to 
compute a function f(X\,X<)) which only depends on the 
value of X\ + X 2 . To put this in context, one can suppose 
there are m\ Boolean variables collocated at node 1 and mi 
Boolean variables at node 2, and both nodes wish to compute a 
symmetric Boolean function of the n := m\ +1112 variables. We 
pose the problem in a block computation setting, where each 
node i has a block of B independent measurements, denoted 
by Xf . We consider the class of all interactive strategies, 
where nodes 1 and 2 transmit messages alternately with the 
value of each subsequent message being allowed to depend 
on all previous transmissions, and the block of measurements 
available at the transmitting node. We define a round to 
include one transmission by each node. A strategy is said to 
achieve correct block computation if for every choice of input 
(Xf,Xf ), each node i can correctly decode the value of the 
function block f B {X\ ,^2) using the sequence of transmissions 
b\J)2,... and its own measurement block Xf. This is the 
direct-sum problem in communication complexity. 

Let S^b be the set of strategies for block length B, which 
achieve zero-error block computation, and let C(f,SB,B) be 
the worst-case total number of bits exchanged under strategy 
Sb £ oS^fl. The worst-case per-instance complexity of comput- 
ing a function f(Xi,X2) is defined as 

C(f,S B ,B) 



C(f) 



lim min 

B^oos B ey B 



B 



A. Complexity of sum-threshold functions 

In this paper, we are only interested in functions f(X\,X2) 
which only depend on X\ + X 2 . Let us suppose without loss 
of generality that m\ < ra 2 . We define an interesting class of 
{0, l}-valued functions called sum-threshold functions. 

Definition 1 (sum-threshold functions): A sum-threshold 
function Ylg(X\,X2) with threshold is defined as follows: 



n e (Xi,x 2 ) 



1 ifXi+X 2 >9, 
otherwise. 



For the special case where m\ = \,m% = \ and = 2, we 
recover the Boolean AND function, which was studied in j6). 
Throughout this paper, we will use tools introduced in |6|. 

Theorem 1: Given any strategy Sb for block computation 
of the function Hg(Xi,X2), 

C(n e (X 1 ,X 2 ),5 B ,B)>Slog 2 {min(20 + l,2m 1 +2,2(/t-0 + l) + l)}. 

Further, there exist single-round strategies S B and S B *, starting 
with nodes 1 and 2 respectively, which satisfy 

C(n e (Xi,X 2 ),S£,B) < rfilog 2 {min(2e + l,2mi + 2,2(n- + 1) + 1)}"|. 
C(n e (X h X 2 ),Ss*,B) < [Blog 2 {min(2e + l,2mi +2,2(n- 6 + 1) + 1)}]. 

Thus, the complexity of computing IIg(Xi,X 2 ) is given by 
C{Ue(X h X 2 )) = log 2 {min(20 + l,2mi +2,2(n - 0+ 1) + 1)}. 
Proof of achievability: We consider three cases: 
(a) Suppose < m\ < mi. We specify a strategy S* B in which 



node 1 transmits first. We begin by observing that inputs X\ = 
9,X\ = (9 + 1) . . . ,X\ =m\ need not be separated, since for 
each of these values of X\ , Ylg (Xi,X 2 ) = 1 for all values of X 2 . 
Thus node 1 has an effective alphabet of {0, 1 , . . . , 0}. Suppose 
node 1 transmits using a prefix-free codeword of length l(X B ). 
At the end of this transmission, node 2 only needs to indicate 
one bit for the instances of the block where X\ = O,1,...,(0 — 
1). Thus the worst-case total number of bits is 

L : = max(/(Xf ) + w°(Xf) + w\xf ) + . . . + w -\xf )), 
x* 

where w 7 (X B ) is the number of instances in the block where 
X\ = j. We are interested in finding the codebook which will 
result in the minimum worst-case number of bits. From the 
Kraft inequality for prefix-free codes we have 

£ 2 -L+«.°(xf)+ M , 1 (xf)+...+H.e-i(zf)) < L 

xfe{o,i 8} B 

Consider a codebook with l(Xf) = \B\og 2 (29 + 1)] - w(xf ). 
This satisfies the Kraft inequality since 

y 2 w°(Xf )+n- 1 (Xf) + ...+H- e -'(Xf)) A w e (Xf ) = ( 29 + 

xfe{o,i,..,e}B 

Hence there exists a prefix-free code for which the worst- 
case total number of bits exchanged is [Z?log 2 (20 + 1)] . Since 
9 < mi < m 2 , we have 

C(U e (X h X 2 ),S* B ,B) < rBlog 2 {min(2e + l,2m 1 +2,2(«-e + l) + l)}]. 

The strategy Sjjj* starting at node 2 can be similarly derived. 
Node 2 now has an effective alphabet of {0,1, ... ,9}, and we 
have C(U e (X u X 2 ),S B *,B) < \Blog 2 (29 + 1)]. 

(b) Suppose m\ < ni2 < 9. Consider a strategy Sg in which 
node 1 transmits first. The inputs X\ = Q,X\ = l,...,Xi = 
9 — Y112 — 1 need not be separated since for each of these values 
of X\, I\q{X\,X2) = for all values of X2- Thus node 1 has an 
effective alphabet of { 9 — m 2 — 1 , 9 — m 2 , . . . , m\ }. Upon hear- 
ing node l's transmission, node 2 only needs to indicate one 
bit for the instances of the block where X\ = 9 — m 2 ,...,mi. 
Consider a codebook with l(Xf) = [Blog 2 (2(mi + m 2 — 9 + 
1) + 1)1 - w 9 -" 12 (Xf ) - ... - w'" 1 (Xf ). This satisfies the Kraft 
inequality and we have L = \B log 2 (2(n — 9 + 1) + 1 )] . Since 
mi < ni2 < 9, we have that 

C(U g (X u X 2 ),S* B ,B) < fBlog 2 {min(26 + l,2 mi +2,2(n- 9 + 1) + 1)}"|. 

The strategy S^* starting at node 2 can be analogously derived. 

(c) Suppose mi < 9 < m 2 . For the case where node 1 transmits 
first, we construct a trivial strategy where node 1 uses a 
codeword of length [Z?log 2 (mi + 1)] bits and node 2 replies 
with a string of B bits indicating the function block. Thus we 
have C(U e {Xi,X 2 ),S* B ,B) < fBlog 2 (2mi +2)"|. 

Now consider a strategy Sg* where node 2 transmits first. 
Observe that the inputs X 2 = 0,X 2 = 1, . . . ,X 2 = 9 — mi — 1 
need not be separated since for each of these values of 
X 2 , ne(Xi,X 2 ) = for all values of X 2 . Further, the inputs 
X 2 = 0,X 2 = 9 + 1, . . . ,X 2 = /«2 need not be separated. Thus 



node 1 has an effective alphabet of {9 — mi — 1, 9 — mi , . . . , 9}. 
Upon hearing node 2's transmission, node 1 only needs 
to indicate one bit for the instances of the block where 
X2 = 9 — mi,... , 9 — 1, Consider a codebook with l(X^) = 



rBlog 2 (2/ni+2)l 



w 



w 



l (Xf). This satis- 



fies the Kraft inequality and we have L = |Blog 2 (2(/i — 9 + 
1) + 1)]. Since mi < 9 < 1112, we have that 

C(n e (X h X 2 ),Ss,B) < [Blog 2 {min(20 + l,2mi +2,2(n- 6 + 1) + 1)}]. 

The lower bound is shown by constructing a fooling set JT]. 

Definition 2 (Fooling Set): A set E C S£ x <3/ is said to be 
a fooling set, if for any two distinct elements (xi,yi),(x2,y 2 ) 
in E, we have either 

• f(xi,yi) ^ f(x 2 ,y2), or 

• f(xi,yi) =f(x 2 ,y 2 ), but either f{x u y 2 ) ^ f(x u yi) or 
f(x 2 ,yi)^ f{xi,yi). 

Given a fooling set E for a function /(Xi,X 2 ), we have 
C(/(Xi,X 2 ))>log 2 \E\. 

Proof of Lower Bound: Define the measurement matrix M to 
be the matrix obtained by stacking the row Xf over the row 
Xj 8 . Let E denote the set of all measurement matrices which 
are made up only of the column vectors from the set 



zi 

Z.2 



:Q<zx <mi,0<z 2 <m 2 ,{e-l)<zi+Z2 < 6 



We claim that E is the appropriate fooling set. Consider two 
distinct measurement matrices M\,M 2 £ E. Let f B (M\) and 
/ B (M 2 ) be the block function values obtained from these two 
matrices. If f B (M\) ^/ B (M 2 ), we are done. Let us suppose 
f B {M\) =/ B (M 2 ), and note that since Mi ^M 2 , there must 
exist one column where M\ and M 2 differ. Suppose M\ has 



Zla 



while M 2 has 



Z16 

Z2b 



where zi a +z 2a =Zi b +z 2 b- 
ity that zu < Z\b and z 2ll > Z2b- 



Assume without loss of genera 

• If zia+Z2a = Z\b +Z 2 b = 9 — 1, then the diagonal element 
f{zib,Z2a) = 1 since Zib+Z2a > 9. Thus, if we replace the 
first row of M\ with the first row of M 2 , the resulting mea- 
surement matrix, say M* , is such that f(M*) ^ /(Mi). 

• If Zi a +Z2a = Zib + Z21, = 9, then the diagonal element 
f{zia,Z2b) =0 since Zib+Zia < 6- Thus, if we replace 
the second row of Mi with the second row of M 2 , the 
resulting matrix M* is such that f(M*) ^ /(Mi). 

Thus, the set £ is a valid fooling set with cardinality \Z\ B . 
For any strategy Sb, we have C(/,Sb,B) >Z?log 2 |Z|. The 
cardinality of Z can be modeled as the sum of the coefficients 
of Y 6 and Y e ~ l in a carefully constructed polynomial: 



3-1 



(l+Y + ... + Y'"')(l+Y + ...+Y m2 ) 



This is solved using the binomial expansion for 



(l-Y)k 



1 151. 



(a) Suppose 9 < mi < m 2 . Then \Z\ = 9 + 9 + 1, 

(b) Suppose mi < 9 < m 2 . Then |Z| = 2mi +2. 

(c) Suppose m l <m 2 < 9. Then jzj = 2(n - 9 + 1) + 1. 
This completes the proof of Theorem Q] □ 



B. Complexity of sum-interval functions 

Definition 3 (sum-interval functions): A sum-interval func- 
tion Yl^ a b ^(Xi,X 2 ) on the interval [a,b] is defined as follows: 

f 1 ifa<Xi+X 2 <i, 

n M (Xi,x 2 ).= | Q otherwise 

Theorem 2: Given any strategy Sb for block computation 
of Il[ a ^](Xi,X2) where b < n/2, 

C{U [aM (X t ,X 2 ) , S B , B) > B log 2 {min(2fc - a + 3 , mi + 1 ) }. 

Further, there exists a single-round strategy S B which satisfies 

C(n [aM (X u X 2 ),Si,B)<\B\og 2 {mm(2(b+l) + l,2m l +2)}l 

Thus, we have obtained the complexity of computing 
Ylg(Xi,X 2 ) to within one bit. 

C. A general strategy for achievability 

The strategy for achievability used in Theorems [TJ and 
12 suggests an achievable scheme for any general function 
f{X\,X 2 ) of variables X\ G 3£\ and X 2 G 3£ 2 which depends 
only on the value of X\ + X 2 . This is done in two stages. 
Separation: Two inputs x\ a and x\b need not be separated 
if f{x\ a ,x 2 ) = f{x\biX 2 ) for all values x 2 . By checking this 
condition for each pair (xi a ,xib), we can arrive at a partition 
of {0,1..., mi} into equivalence classes, which can be con- 
sidered a reduced alphabet, say A := {a\,... 
Coding: Let Aq denote the subset of the alphabet A for 
which the function evaluates only to 0, irrespective of the 
value of X 2 , and let A\ denote the subset of A which always 
evaluates to 1. Clearly, from the equivalence class structure, 
we have |Ao| < 1 and \A\\ < 1. Using the Kraft inequality as 
in Theorems Q] and [2] we obtain a scheme S B with complexity 
log 2 (2/-|A |-|A 1 |). 

IV. Computing symmetric Boolean functions on 

TREE NETWORKS 

Consider a tree graph T = (V,E), with node set V = 
{0,1,..., n) and edge set E. Each node i has a Boolean 
variable Xi G {0,1}, and every node wants to compute a 
given symmetric Boolean function f(X\,X 2 , . . . ,X n ). Again, 
we allow for block computation and consider all strategies 
where nodes can transmit in any sequence with possible 
repititions, subject to: 

• On any edge e = (i,j), either node i transmits or node 
j transmits, or neither, and this is determined from the 
previous transmissions. 

• Node i's transmission can depend on the previous trans- 
missions and the measurement block Xf . 

For sum-threshold functions, we have a computation and 
communication strategy that is optimal for each link. 

Theorem 3: Consider a tree network where we want to 
compute the function Ylg(Xi, . ..,X n ). Let us focus on a single 
edge e = (i,j) whose removal disconnects the graph into com- 
ponents A e and V\A e , with \A e \ < \V\A e \. For any strategy 



Sb G <5^B, the number of bits exchanged along edge e = 
denoted by C e (Ylg(Xi, . . . ,X„),Sb,B), is lower bounded by 

C e (U e (X u . . . ,X n ),S B ,B) > Blog 2 {min(2e + l,2\A e \ + 2,2(« - 9 + 1) + 1)}. 

Further, there exists a strategy Sg such that for any edge e, 

C e (n e (X h ...,X„),S* B ,B) < [Blog 2 {min(2e + l,2|A c |+2,2(; 1 -e + l) + l)}]. 

The complexity of computing Tlg(X\, . . . ,X„) is given by 
C e (U e (Xi ,X n )) = log 2 {min(20 + l,2\A e \ + 2,2(n -6+1) + 1)}. 

Proof: Given a tree network T, every edge e is a cut edge. 
Consider an edge e whose removal creates components A e and 
V\A e , with \A e \ < |V\A e |. Now let us aggregate the nodes 
in A e and also those in V\A e , and view this as a problem 
with two nodes connected by edge e. Clearly the complexity 
of computing the function Ylg(XA e ,X v \^ e ) is a lower bound on 
the worst-case total number of bits that must be exchanged on 
edge e under any strategy Sb- Hence we obtain 

C e (Ug(X h X„), S B ,B) >Blog 2 {min(2e + l,2|A e |+2,2(n-e + l) + l)}. 

The achievable strategy S B is derived from the achievable 
strategy for the two node case in Theorem Q] While the 
transmissions back and forth along any edge will be exactly 
the same, we need to orchestrate these transmissions so that 
conditions of causality are maintained. Pick any node, say 
r, to be the root. This induces a partial order on the tree 
network. We start with each leaf in the network transmitting 
its codeword to the parent. Once a parent node obtains a 
codeword from each of its children, it has sufficient knowledge 
to disambiguate the letters of the effective alphabet of the 
subtree, and subsequently it transmits a codeword to its parent. 
Thus codewords are transmitted from child nodes to parent 
nodes until the root is reached. The root can then compute the 
value of the function and now sends the appropriate replies 
to its children. The children then compute the function and 
send appropriate replies, and so on. This sequential strategy 
depends critically on the fact that, in the two node problem, 
we derived optimal strategies starting from either node. For 
any edge e, the worst-case total number of bits exchanged is 
given by 

C e (n e (Z 1 ,...A),SS,B)<rBlog 2 {min(2e + l,2|A e |+2,2(n-e + l) + l)}l.D 

One can similarly derive an approximately optimal strategy 
for sum-interval functions, which we state here without proof. 

Theorem 4: Consider a tree network where we want to 
compute the function Yl^ a b](Xi, . . . ,X n ), with b < |. Let us 
focus on a single edge e = (i,j) whose removal disconnects 
the graph into components A e and V\A e , with \A e \ < |V\A e |. 
For any strategy Sb G 5?b, the number of bits exchanged along 
edge e = denoted by C e (f ,Sb,B) is lower bounded by 

C e {U [aM (X 1 ,...,X n ),S B ,B)>Blog 2 {mm{2b-a+3,\A e \ + l)}. 

Further there exists a strategy S B such that for any edge e, 

C e (n [aM (X u . ..,X„),S*b,B)< rBlog 2 {min(2(fc+ 1) + l,2\A e \ +2)}]. 



A. Extension to non-binary alphabets 

The extension to the case where each node draws measure- 
ments from a non-binary alphabet is immediate. Consider a 
tree network with n nodes where node i has a measurement 
Xj G {0,1,...,/,-— 1}. Suppose all nodes want to compute a 
given function which only depends on the value of Xj + X 2 + 
...+X n . We can define sum-threshold functions in analogous 
fashion and derive an optimal strategy for computation. 

Theorem 5: Consider a tree network where we want to 
compute a sum-threshold function, Ylg(Xi, . . . ,X„), of non- 
binary measurements. Let us focus on a single edge e whose 
removal disconnects the graph into components A e and V \A e . 
Let us define l Ae := Y,ieA h- Then the complexity of computing 
Yl e (X u ...,X n ) is given by 

C e (n fl (X 1 ,...,X„)) = log 2 {min(2e + l,2min(/ Af ,/ l/w ) + 2,2(/ v -e+l) + l)}. 

Theorem 2] also extends to the case of non-binary alphabets. 
V. Computing sum-threshold functions in general 

GRAPHS 

We now consider the computation of sum-threshold func- 
tions in general graphs where the alphabet is not restricted to 
be binary. A cut is defined to be a set of edges F C E which 
disconnect the network into two components Af and V\Ap. 

Lemma 1 ( Cut-set bound): Consider a general network 
G = (V,E), where node i has measurement Xj 6 {0, 1 , . . . , /, — 
1 } and all nodes want to compute the function Tig (Xi, . . . ,X„). 
Given a cut F which separates Af from V\Af, the cut-set 
lower bound specifies that: For any strategy Sb, the number 
of bits exchanged on the edges in F is lower bounded by 

C F (n e (X l ,...,X„),S B ,B) >BIog 2 (min{2e + l,2m F + 2,2(/ v -e + l) + l)}. 

where l Af = £ ieAF h and m F = min(/ Af , l v \ Af ). 

A natural achievable strategy is to pick a spanning subtree of 
edges and use the optimal strategy on this subtree. The convex 
hull of the rate vectors of the subtree aggregation schemes, is 
an achievable region. We wish to compare this with the cut- 
set region. To simplify matters, consider a complete graph 
G where each node i has a measurement X, e {0, 1 }. 
Let R ac f, be the maximum symmetric ratepoint achievable by 
aggregating along trees, and R cut be the minimum symmetric 
ratepoint that satisfies the cut-set constraints. 

Theorem 6: For the computation of sum-threshold func- 
tions on complete graphs, R ac f, < 2(1 — -))R cu t- In fact, this 
approximation ratio is tight. 

Proof: Let us assume without loss of generality that < ^j-. 
Consider all cuts of the type ({«'}, V \ {z}). This yields 

/ mm (log 2 (20 + 1), log 2 (2Z,- + 2) ) \ 

K C ut max . 

iev \ n — l J 

Now consider the achievable scheme which employs each of 
the « star graphs for equal sized sub-blocks of measurements. 
The rate on edge is given by 

- (min(log 2 (29 + 1), log 2 (2/,- + 2) ) + min(log 2 (29 + 1 ) , log 2 (21 j + 2) )) 



Hence we have 

Rach < -(min(log 2 (20 + l),max{log 2 (2/, + 2)})) < 2 ( 1 - - ) R cut . 



VI. Concluding Remarks 

In this paper, we have addressed the computation of sym- 
metric Boolean functions in tree networks, where all nodes 
want to compute the function. Toward this objective, we 
derived lower bounds on the number of bits that must be 
exchanged on each edge, using communication complexity 
theory. Further, for each edge, we devise an achievable scheme 
for block computation that involves separation followed by 
prefix-free coding. We then sequence the transmissions so that 
information flows up the tree to a root node and then back 
down to the leaves. For the case of sum-threshold functions, 
our resulting achievable scheme is optimal. 

The approach presented also provides lower and upper 
bounds for the complexity of other functions like sum-interval 
functions. Our framework can be generalized to handle func- 
tions of integer measurements which only depend on the sum 
of the measurements. The extension to general graphs is very 
interesting and appears significantly harder. However, a cut-set 
lower bound can be immediately derived, and in some special 
cases we can show that subtree aggregation schemes provide 
a 2-OPT solution. 
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